Reduction Optimization in Heterogeneous Cluster Environments
نویسندگان
چکیده
Network of workstation (NOW) is a cost-e ective alternative to massively parallel supercomputers. As commercially available o -the-shelf processors become cheaper and faster, it is now possible to build a cluster that provides high computing power within a limited budget. However, a cluster may consist of di erent types of processors and this heterogeneity complicates the design of e cient collective communication protocols. For example, it is a very hard combinatorial problem to nd an optimal reduction schedule for such heterogeneous clusters. Nevertheless, we show that a simple technique called slowest-noderst (SNF) is very e ective in designing e cient reduction protocols for heterogeneous clusters. First, we show that SNF is actually an approximation algorithm with competitive ratio two. In addition, we show that SNF does give the optimal reduction time when the cluster consists of two types of processors, and the ratio of communication speed between them is at least two.
منابع مشابه
Adaptive Dynamic Data Placement Algorithm for Hadoop in Heterogeneous Environments
Hadoop MapReduce framework is an important distributed processing model for large-scale data intensive applications. The current Hadoop and the existing Hadoop distributed file system’s rack-aware data placement strategy in MapReduce in the homogeneous Hadoop cluster assume that each node in a cluster has the same computing capacity and a same workload is assigned to each node. Default Hadoop d...
متن کاملCollective Communication Optimization with Dynamic Programming in Heterogeneous Cluster Environments
Network of workstation (NOW) is a cost-effective alternative to massively parallel supercomputers. However, the heterogeneity complicates the design of efficient collective communication protocols. Nevertheless, we demonstrate that a complex reduction scheduling problem can be transformed into a tree path length optimization problem and we develop dynamic programming techniques to solve this pr...
متن کاملParallel asynchronous particle swarm optimization.
The high computational cost of complex engineering optimization problems has motivated the development of parallel optimization algorithms. A recent example is the parallel particle swarm optimization (PSO) algorithm, which is valuable due to its global search capabilities. Unfortunately, because existing parallel implementations are synchronous (PSPSO), they do not make efficient use of comput...
متن کاملCost Minimization Heuristics for Scheduling Workflows on Heterogeneous Distributed Environments
Many large scale scientific problems require computing power that goes beyond the capabilities of a single machine. The data and compute requirements of these problems demand a high performance computing environment such as a cluster, a grid or a cloud platform in order to be solved in a reasonable amount of time. In order to efficiently execute workflows and utilize the distributed resources i...
متن کاملMulti-criteria genetic algorithm applied to scheduling in multi-cluster environments Open
Scheduling and resource allocation to optimize performance criteria in multi-cluster heterogeneous environments is known as an NP-hard problem, not only for the resource heterogeneity, but also for the possibility of applying co-allocation to take advantage of idle resources across clusters. A common practice is to use basic heuristics to attempt to optimize some performance criteria by treatin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000